"Birds in the Clouds": Adventures in Data Engineering
نویسندگان
چکیده
Leveraging their eBird crowdsourcing project, the Cornell Lab of Ornithology generates sophisticated Spatio-Temporal Exploratory Model (STEM) maps of bird migrations. Such maps are highly relevant for both scientific and educational purposes, but creating them requires advanced modeling techniques that rely on long and potentially expensive computations. In this paper, we share our experience porting the eBird data pipeline from a physical cluster to the cloud, providing a seamless deployment at a lower cost. Using open source tools and cloud ”marketplaces”, we managed to divide the operating costs by a factor of 6, saving hundreds of thousands of dollars.
منابع مشابه
Detection of some Tree Species from Terrestrial Laser Scanner Point Cloud Data Using Support-vector Machine and Nearest Neighborhood Algorithms
acquisition field reference data using conventional methods due to limited and time-consuming data from a single tree in recent years, to generate reference data for forest studies using terrestrial laser scanner data, aerial laser scanner data, radar and Optics has become commonplace, and complete, accurate 3D data from a single tree or reference trees can be recorded. The detection and identi...
متن کاملLaminar Flame Speed Prediction in Lean Mixture of Aluminum Dust Clouds by Considering the Effect of Random Distribution of Particles in Two-dimension
In the present study, the effect of random distribution of reactants and products on laminar, 2D and steady-state flame propagation in aluminium particles has been investigated. The equations are solved only for lean mixture. The flame structure is assumed to consist of a preheat zone, a reaction zone and a post flame zone. It is presumed that in the preheat zone particles are heated an...
متن کامل3D Detection of Power-Transmission Lines in Point Clouds Using Random Forest Method
Inspection of power transmission lines using classic experts based methods suffers from disadvantages such as highel level of time and money consumption. Advent of UAVs and their application in aerial data gathering help to decrease the time and cost promenantly. The purpose of this research is to present an efficient automated method for inspection of power transmission lines based on point c...
متن کاملEffects of Structure and Partially Localization of the π Electron Clouds of Single-Walled Carbon Nanotubes on the Cation-π Interactions
A C102H30 graphene sheet has been rolled up to construct Single-Walled Carbon NanoTube Fragments (SWCNTFs) as parts of armchair carbon nanotubes by computational quantum chemistry methods. Non-covalent cation-π interactions of the Na+ cation on the central rings of SWCNTFs have investigated. The binding energies of the Na+-SWCNTF complexes versus ...
متن کاملتاثیر نوع ابرهای پایین جو بر میزان دقت شبیه سازی رواناب در مدل SWAT
Introduction: Patterns of spatial and temporal rainfall impact on runoff and outlet hydrograph (Cordery, 1993; James, 1994). Results of different studies have clarified that simulation by using diverse rainfall data could increase the reliance of results. These were much more sensible in which areas encounter with data scarcity (Mello et al., 2008; Bekiaris et al., 2008). Rainfall properties in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1710.08521 شماره
صفحات -
تاریخ انتشار 2017